The Twitter Users of South Korea Datasets contains the following parts:

1. edge_list_encry.gz   This dataset contains the South Korean Twitter user following network edge list. To protect the user's privacy, we provide the encrypted user id in network.

2. ego_profile.csv   This dataset contains the 2.59 million users' profile information (number of followers, number of followees, number of status). This dataset could recreate the distribution of user profile in the study.

3. user_id.csv    This dataset contains 2.31 million user IDs who are linked with other South Koream users in the network. The user ids provided by Twitter can be used to identify the user and his tweets though Twitter APIs.

4. social_bot_by_botornot_account_32862.csv   This dataset contains 30000+ ramdom users detected by botornot. 

5. korean_twitter_user_transitivity_score.csv This dataset contains the transitive clustering coefficient for the directed network of South Korean Twitter users.

6. topic distribution.csv  This contains the topic distribution, based on a random sample of 10,000 users’ timeline

7. circadian_gb.csv  This contains the groupby dataset of circadian rhythm pattern. 




Notes:
1. In compliance with Twitter’s policy, we only provide the user ids and profile results separately to protect user's privacy. The user ids provided by Twitter can be used to identify the user and his tweets though Twitter APIs.
2. Python and the pandas library can be used to wrangle the data once the tweets have been rehydrated.